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A SYSTEM AND METHOD FOR 
IMPROVING THE PRECISION OF LOCALIZATION ESTIMATES 

10 BACKGROUND 
Technical Field: 

The invention is related to object localization and tracking within a 
prescribed search area, and in particular, to a system and process for improving 
the precision of localization estimates generated by use of a receiving array, 
such as, for example, microphone arrays, directional antenna arrays, radar 
receiver arrays, etc., by providing cluster-based statistical post-processing of 
initial localization measurements or estimates. 

Background Art: 

Localization and tracking of objects within prescribed regions is an 
important element of many systems. For example, a number of conventional 
25 audio conferencing applications use microphone arrays with conventional sound 
source localization (SSL) processing (i.e., time-delay estimates, beamsteering, 
etc.) to enable the speech or sound of particular individuals to be effectively 
isolated and processed as desired. Similar techniques have used arrays of 
directional antennas for locating radio sources for a number of applications, such 
30 as, for example, for determining which node or nodes are to be used by particular 
subscribers within in a wireless computer network. Still other similar techniques 
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have been used for tracking objects using radar or laser receiver arrays. In 
general, such techniques are well known to those skilled in the art. 

For example, conventional microphone arrays typically include an 
5 arrangement of microphones in some predetermined layout. These microphones 
are generally used to capture sound waves from various directions and 
originating from different points in space. One of a number of conventional 
techniques is then used to perform SSL. In general, these SSL techniques fall 
into two categories, including those based on time delay estimates (TDE), and 

10 those based on beamsteering. Finding the direction to a sound source plays an 
important role in spatial filtering, i.e. pointing a beam to the sound source and 
suppressing any noises coming from other directions. In some cases the 
direction to the sound source is used for speaker tracking and post-processing of 
recorded audio signals. In the context of a video conferencing system, speaker 

15 tracking is often used for dynamically directing a video camera toward the person 
speaking. 

In general, most sound source localization systems process the signals 
from a microphone array by first preprocessing each signal from each 
20 microphone of the array. This preprocessing typically includes packaging the 
signal in frames, performing noise suppression, and classifying individual frames 
for determining whether particular frames will be processed or rejected for the 
purposes of determining the location of the sound source. 

25 Once the preprocessing is complete, the actual sound source localization 

typically involves the use of conventional SSL techniques including, for example, 
TDE or beamsteering techniques, to provide either initial direction estimates or a 
probability distribution function (PDF) for indicating where a sound source is 
located. This location can be defined in terms of one-dimensional localization 

30 (i.e., the angle along which the sound source is located in a plane), two- 
dimensional localization (i.e., two angles, direction and elevation, for defining a 
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vector representing the direction of the sound source in a three dimensional 
space), and full three-dimensional localization (i.e., direction, elevation and 
distance, for locating a point in a three-dimensional space at which the sound 
source is located). In general, whichever SSL technique is used, the goal is 
5 typically to provide robustness to reverberation, the ability to distinguish multiple 
sound sources, and high location precision in potentially noisy environments. 

Once an indicator of the sound source location has been computed, a 
post-processing phase is often implemented. In general, this post-processing 

10 combines the results of several localization measurements to increase the 
precision, to follow the sound source movements, or to track multiple sound 
sources. Various conventional techniques used for SSL post-processing include 
simple averaging, statistical processing, Kalman filtering, particle filtering, etc. 
Such techniques are typically application dependent, but are generally directed at 

15 removing localizations from reverberated waves and strong reflections, and to 
improve sound source localization precision. In general, as the precision of the 
localization estimates or measurements increases, any further processing of the 
audio signal (such as, for example, accurate sound source tracking) is enhanced. 

20 Signal source or object localization with respect to other signal types, 

including radio signals, radar waves, etc., is often accomplished using pre-and 
post-processing techniques similar to those described above for the case of 
sound waves captured via a microphone array. In general, such localization 
techniques often include beamsteering techniques adapted for different signal 

25 and receiver array types (e.g., directional antenna arrays, radar or laser receiver 
arrays, etc.). As with audio signals, localization of other signal types is typically 
based on analysis of the propagation of signals (e.g., sound waves, radio waves, 
radar wave reflections, etc.). 

30 With all such localization systems, regardless of signal or array type, one 

primary goal is to provide fast and precise localization estimates or 
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measurements, even in the presence of noise and other effects, such as 
diffraction, interference, reflection, etc., which tend to decrease localization 
precision and reliability. 

5 As noted above, post-processing of localization estimates is generally 

designed to increase the precision of localization estimates. Therefore, what is 
needed is a system and process for providing fast and reliable post-processing of 
localization data for improving the precision of localization estimates. Further, 
such a system and method should be operable with and adaptable for existing 

10 localization techniques. 

SUMMARY 

is A system and method as described herein operates to enhance existing 

signal localization techniques for tracking or locating one or more objects or 
signal sources by improving reliability and precision of initial localization 
estimates derived from conventional localization techniques. Note that 
throughout the following discussion, the term "object" will be used to refer either 

20 to actual objects being tracked or located via localization methods, or to signal 
sources (such as sound from a person talking). Conventional localization 
techniques that are improved through use of the post-processing techniques 
include, for example, conventional sound source localization (SSL) systems 
based on microphone array inputs, conventional radio source location systems 

25 based on directional antenna array inputs, conventional target location and 
tracking systems based on radar or laser receiver arrays, etc. Note that such 
localization techniques are well known to those skilled in the art, and will not be 
described in detail herein. 

30 In general, the post-processing system and method described herein 

applies statistical real-time clustering process to initial localization estimates, and 
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then uses this real-time clustering to generate new localization estimates having 
improved precision and reliability relative to the initial localization estimates. As 
noted above, the post-processing techniques described herein are adaptable for 
use with conventional systems that provide signal source localization estimates. 
5 Further, the system and method described herein has also been observed to 
provide improved precision and reliability where initial object localization 
estimates are gathered in environments which may include noise, reflections, 
reverberations, or other interference. 

In particular, the process described herein begins by collecting, 
generating, or otherwise acquiring initial object localization estimates or 
measurements using any of a number of conventional localization techniques. In 
general, conventional localization data is typically provided in one, two, or three 
dimensions (e.g., direction; direction and angle; or direction, angle and distance) 
as a function of time. While the post-processing techniques described herein 
may be generalized to localization data of any dimensionality, for purposes of 
explanation the following discussion will assume that the localization data is 
three-dimensional, i.e., direction, elevation and distance within a prescribed 
region, along with known or computed standard deviations for direction, elevation 
and distance, a^, a e% and a pi respectively. 

In addition to simply providing position estimates, conventional localization 
techniques frequently provide weight or confidence measures or estimates for 
describing a computed or estimated reliability for each localization estimate. 
25 Further, a time for each localization estimate is also typically provided. This time 
will be referred to hereinafter as a "time stamp" for each localization estimate. All 
of this information is used in the post-processing system and method described 
herein for increasing the precision and reliability of the initial localization 
estimates. 
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In other words, the post-processing system described herein takes an 
input of conventional localization estimates, with each localization estimate 
comprising: 1) positional data; 2) estimated positional reliability; and 3) a data 
time stamp. This data is then used to derive new localization estimates having 
5 improved reliability relative to the input localization estimates. This set of 

information (i.e., position, reliability, and time) for each initial localization estimate 
will be referred to throughout this description as "initial localization estimates," or 
simply as "initial measurements." 

10 Reliability of positional or localization estimates is assumed to decrease 

as time passes, especially where object motion, or receiving array motion, is 
possible. Therefore, in one embodiment, given the initial localization estimates, 
the post-processing system and method described herein begins by first 
discarding all localization measurements older than a predetermined time. This 

15 predetermined time, or measurement "lifetime," is the time during which any 
particular localization estimate will be considered valid, and therefore, during 
which the localization estimate will be used for post-processing computations. 
This "lifetime" is simply a measure of the time since a particular localization 
estimate was generated (i.e., the time between the time stamp of the particular 

20 localization estimate and the current time). 

In general, as the lifetime increases, more localization estimates will be 
available for post-processing. Typically, given a larger lifetime, the larger number 
of localization estimates available allows for a more reliable distinction to be 

25 made between real objects and possible reflections, thereby providing a better 
localization precision. However, use of a larger initial localization estimate 
lifetime also results in longer localization response times when an object moves 
or otherwise changes position relative to a receiving array. Therefore, optimal 
selection of localization estimate lifetime is made as a function of expected object 

30 motion. For example, given stationary or slow moving objects, longer lifetimes 
are typically appropriate. Conversely, shorter lifetimes are typically more 
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appropriate for faster object motions. Consequently, in one embodiment, an 
adaptive lifetime is computed based on computed object motions, with the 
lifetime being chosen as a tradeoff between response time and position validity 
as a function of time. The computed motions are simply computed using position 
5 estimates as a function of time. 

As described in further detail herein, the post-processing then continues 
with a multi-stage process, including: 1 ) "clustering" the initial localization 
estimates (those within the predetermined life-time) in spatially spread 

10 overlapping sections; 2) identifying "potential objects" within the clusters; 3) 
estimating the position and standard deviation of the potential objects based on 
the clustering; and 4) eliminating likely duplicate objects. In one embodiment, the 
post-processing further continues by computing a confidence level for the 
position estimate for each object (after elimination of duplicate objects). 

15 Consequently, the final result of the post-processing system and method 
described herein is a list of objects, each presented with location, standard 
deviation, and in one embodiment, the confidence level of the object location. 

In view of the above summary, it is clear that the post-processing 
20 techniques described herein provide an improved system and process for 
improving the precision of localization estimates derived from conventional 
localization techniques. In addition to the just described benefits, other 
advantages of this system and method will become apparent from the detailed 
description which follows hereinafter when taken in conjunction with the 
25 accompanying drawing figures. 
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DESCRIPTION OF THE DRAWINGS 



The specific features, aspects, and advantages of the present invention 
will become better understood with regard to the following description, appended 
5 claims, and accompanying drawings where: 

FIG. 1 is a general system diagram depicting a general-purpose 
computing device constituting an exemplary system for post-processing of initial 
localization data to generate new localization estimates having improved 
10 precision and reliability. 

FIG. 2 illustrates an exemplary system diagram showing exemplary 
program modules for implementing a post-processor for post-processing of initial 
localization data to generate new localization estimates having improved 
15 precision and reliability. 

FIG. 3 provides an exemplary operational flow diagram which illustrates 
the operation of the post-processor of FIG. 2. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



In the following description of the preferred embodiments of the present 
invention, reference is made to the accompanying drawings, which form a part 
5 hereof, and in which is shown by way of illustration specific embodiments in 
which the invention may be practiced. It is understood that other embodiments 
may be utilized and structural changes may be made without departing from the 
scope of the present invention. 

10 1.0 Exemplary Operating Environment: 

Figure 1 illustrates an example of a suitable computing system 
environment 100 with which the invention may be implemented. The computing 
system environment 100 is only one example of a suitable computing 
15 environment and is not intended to suggest any limitation as to the scope of use 
or functionality of the invention. Neither should the computing environment 100 
be interpreted as having any dependency or requirement relating to any one or 
combination of components illustrated in the exemplary operating environment 
100. 

20 

The invention is operational with numerous other general purpose or 
special purpose computing system environments or configurations. Examples of 
well known computing systems, environments, and/or configurations that may be 
suitable for use with the invention include, but are not limited to, personal 

25 computers, server computers, hand-held, laptop or mobile computer or 
communications devices such as cell phones and PDA's, multiprocessor 
systems, microprocessor-based systems, set top boxes, programmable 
consumer electronics, network PCs, minicomputers, mainframe computers, 
distributed computing environments that include any of the above systems or 

30 devices, and the like. 
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The invention may be described in the general context of computer- 
executable instructions, such as program modules, being executed by a 
computer in combination with hardware modules, including components of a 
microphone array 198, or other receiver array (not shown), such as, for example, 

5 a directional radio antenna array, a radar receiver array, etc. Generally, program 
modules include routines, programs, objects, components, data structures, etc., 
that perform particular tasks or implement particular abstract data types. The 
invention may also be practiced in distributed computing environments where 
tasks are performed by remote processing devices that are linked through a 

10 communications network. In a distributed computing environment, program 
modules may be located in both local and remote computer storage media 
including memory storage devices. With reference to Figure 1 , an exemplary 
system for implementing the invention includes a general-purpose computing 
device in the form of a computer 110. 

15 

Components of computer 110 may include, but are not limited to, a 
processing unit 120, a system memory 130, and a system bus 121 that couples 
various system components including the system memory to the processing unit 
120. The system bus 121 may be any of several types of bus structures 

20 including a memory bus or memory controller, a peripheral bus, and a local bus 
using any of a variety of bus architectures. By way of example, and not 
limitation, such architectures include Industry Standard Architecture (ISA) bus, 
Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video 
Electronics Standards Association (VESA) local bus, and Peripheral Component 

25 Interconnect (PCI) bus also known as Mezzanine bus. 

Computer 1 10 typically includes a variety of computer readable media. 
Computer readable media can be any available media that can be accessed by 
computer 110 and includes both volatile and nonvolatile media, removable and 
30 non-removable media. By way of example, and not limitation, computer readable 
media may comprise computer storage media and communication media. 
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Computer storage media includes volatile and nonvolatile removable and non- 
removable media implemented in any method or technology for storage of 
information such as computer readable instructions, data structures, program 
modules, or other data. 

5 

Computer storage media includes, but is not limited to, RAM, ROM, 
PROM, EPROM, EEPROM, flash memory, or other memory technology; CD- 
ROM, digital versatile disks (DVD), or other optical disk storage; magnetic 
cassettes, magnetic tape, magnetic disk storage, or other magnetic storage 

10 devices; or any other medium which can be used to store the desired information 
and which can be accessed by computer 110. Communication media typically 
embodies computer readable instructions, data structures, program modules or 
other data in a modulated data signal such as a carrier wave or other transport 
mechanism and includes any information delivery media. The term "modulated 

15 data signal" means a signal that has one or more of its characteristics set or 
changed in such a manner as to encode information in the signal. By way of 
example, and not limitation, communication media includes wired media such as 
a wired network or direct-wired connection, and wireless media such as acoustic, 
RF, infrared, and other wireless media. Combinations of any of the above should 

20 also be included within the scope of computer readable media. 

The system memory 130 includes computer storage media in the form of 
volatile and/or nonvolatile memory such as read only memory (ROM) 131 and 
random access memory (RAM) 132. A basic input/output system 133 (BIOS), 

25 containing the basic routines that help to transfer information between elements 
within computer 110, such as during start-up, is typically stored in ROM 131. 
RAM 132 typically contains data and/or program modules that are immediately 
accessible to and/or presently being operated on by processing unit 120. By way 
of example, and not limitation, Figure 1 illustrates operating system 134, 

30 application programs 135, other program modules 136, and program data 137. 
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The computer 110 may also include other removable/non-removable, 
volatile/nonvolatile computer storage media. By way of example only, Figure 1 
illustrates a hard disk drive 141 that reads from or writes to non-removable, 
nonvolatile magnetic media, a magnetic disk drive 151 that reads from or writes 

5 to a removable, nonvolatile magnetic disk 152, and an optical disk drive 155 that 
reads from or writes to a removable, nonvolatile optical disk 156 such as a CD 
ROM or other optical media. Other removable/non-removable, 
volatile/nonvolatile computer storage media that can be used in the exemplary 
operating environment include, but are not limited to, magnetic tape cassettes, 

10 flash memory cards, digital versatile disks, digital video tape, solid state RAM, 
solid state ROM, and the like. The hard disk drive 141 is typically connected to 
the system bus 121 through a non-removable memory interface such as interface 
140, and magnetic disk drive 151 and optical disk drive 155 are typically 
connected to the system bus 121 by a removable memory interface, such as 

15 interface 150. 

The drives and their associated computer storage media discussed above 
and illustrated in Figure 1, provide storage of computer readable instructions, 
data structures, program modules and other data for the computer 110. In Figure 

20 1, for example, hard disk drive 141 is illustrated as storing operating system 144, 
application programs 145, other program modules 146, and program data 147. 
Note that these components can either be the same as or different from 
operating system 134, application programs 135, other program modules 136, 
and program data 137. Operating system 144, application programs 145, other 

25 program modules 146, and program data 147 are given different numbers here to 
illustrate that, at a minimum, they are different copies. A user may enter 
commands and information into the computer 110 through input devices such as 
a keyboard 162 and pointing device 161, commonly referred to as a mouse, 
trackball, or touch pad. 
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Other input devices (not shown) may include a joystick, game pad, 
satellite dish, scanner, radio receiver, and a television or broadcast video 
receiver, or the like. Still further input devices (not shown) may include receiving 
arrays or signal input devices, such as, for example, a directional radio antenna 

5 array, a radar receiver array, etc. These and other input devices are often 
connected to the processing unit 120 through a wired or wireless user input 
interface 160 that is coupled to the system bus 121 , but may be connected by 
other conventional interface and bus structures, such as, for example, a parallel 
port, a game port, a universal serial bus (USB), an IEEE 1394 interface, a 

10 Bluetooth™ wireless interface, an IEEE 802.1 1 wireless interface, etc. Further, 
the computer 110 may also include a speech or audio input device, such as a 
microphone or a microphone array 198, as well as a loudspeaker 197 or other 
sound output device connected via an audio interface 199, again including 
conventional wired or wireless interfaces, such as, for example, parallel, serial, 

15 USB, IEEE 1394, Bluetooth™, etc. 



A monitor 191 or other type of display device is also connected to the 
system bus 121 via an interface, such as a video interface 190. In addition to the 
monitor, computers may also include other peripheral output devices such as a 
20 printer 196, which may be connected through an output peripheral interface 195. 

The computer 110 may operate in a networked environment using logical 
connections to one or more remote computers, such as a remote computer 180. 
The remote computer 180 may be a personal computer, a server, a router, a 

25 network PC, a peer device, or other common network node, and typically 

includes many or all of the elements described above relative to the computer 
110, although only a memory storage device 181 has been illustrated in Figure 1. 
The logical connections depicted in Figure 1 include a local area network (LAN) 
171 and a wide area network (WAN) 173, but may also include other networks. 

30 Such networking environments are commonplace in offices, enterprise-wide 
computer networks, intranets, and the Internet. 
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When used in a LAN networking environment, the computer 110 is 
connected to the LAN 171 through a network interface or adapter 170. When 
used in a WAN networking environment, the computer 110 typically includes a 
modem 172 or other means for establishing communications over the WAN 173, 

5 such as the Internet. The modem 172, which may be internal or external, may be 
connected to the system bus 121 via the user input interface 160, or other 
appropriate mechanism. In a networked environment, program modules 
depicted relative to the computer 1 10, or portions thereof, may be stored in the 
remote memory storage device. By way of example, and not limitation, Figure 1 

10 illustrates remote application programs 185 as residing on memory device 181. 
It will be appreciated that the network connections shown are exemplary and 
other means of establishing a communications link between the computers may 
be used. 

15 The exemplary operating environment having now been discussed, the 

remaining part of this description will be devoted to a discussion of a system and 
method for automatically improving the reliability and precision of object 
localization estimates derived from one or more receiving arrays. 

20 2.0 Introduction: 

A system and method as described herein operates to enhance existing 
signal localization techniques for tracking or locating one or more objects or 
signal sources by improving reliability and precision of initial localization 
25 estimates derived from conventional localization techniques. Note that 

throughout the following discussion, the term "object" will be used to refer either 
to actual objects being tracked or located via localization methods, or to signal 
sources (such as sound from a person talking, radio sources, radar reflections, 
etc.). Localization techniques that are improved through use of the post- 
30 processing techniques include, for example, conventional sound source 

localization (SSL) systems based on microphone array inputs, conventional radio 
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source location systems based on directional antenna array inputs, conventional 
target location and tracking systems based on radar or laser receiver arrays, etc. 
Note that such localization techniques are well known to those skilled in the art, 
and will not be described in detail herein. 

5 

In general, the post-processing system and method described herein 
applies statistical real-time clustering process to initial localization estimates, and 
then uses this real-time clustering to generate new localization estimates having 
improved precision and reliability relative to the initial localization estimates. As 

10 noted above, the post-processing techniques described herein are adaptable for 
use with conventional systems that provide signal source localization estimates. 
Further, the system and method described herein has also been observed to 
provide improved precision and reliability where initial object localization 
estimates are gathered in environments which may include noise, reflections, 

15 reverberations, or other interference. 

2.1 System Overview: 

The system and method described herein for post-processing of 
20 localization estimates begins by collecting, generating, or otherwise acquiring 
initial object localization estimates or measurements using any of a number of 
conventional localization techniques. In general, conventional localization data is 
typically provided in one, two, or three dimensions (e.g., direction; direction and 
angle; or direction, angle and distance) as a function of time. While the post- 
25 processing techniques described herein may be generalized to localization data 
of any dimensionality, for purposes of explanation the following discussion will 
assume that the localization data is three-dimensional, i.e., direction, elevation 
and distance within a prescribed region, along with known or computed standard 
deviations for direction, elevation and distance, a v , a e , and a , respectively. 

30 
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In addition to simply providing position estimates, conventional localization 
techniques frequently provide weight or confidence measures or estimates for 
describing a computed or estimated reliability for each localization estimate. 
Further, a time for each localization estimate is also typically provided. This time 
5 will be referred to hereinafter as a "time stamp" for each localization estimate. All 
of this information is used in the post-processing system and method described 
herein for increasing the precision and reliability of the initial localization 
estimates. 

10 In other words, the post-processing system described herein takes an 

input of conventional localization estimates, with each localization estimate 
comprising: 1) positional data; 2) estimated positional reliability; and 3) a data 
time stamp. This data is then used to derive new localization estimates having 
improved reliability relative to the input localization estimates. This set of 

15 information (i.e., position, reliability, and time) for each initial localization estimate 
will be referred to throughout this description as "initial localization estimates," or 
simply as "initial measurements." 

Reliability of positional or localization estimates is assumed to decrease 
20 as time passes, especially where object motion, or receiving array motion, is 
possible. Therefore, in one embodiment, given the initial localization estimates, 
the post-processing system and method described herein begins by first 
discarding all localization measurements older than a predetermined time. This 
predetermined time, or measurement "lifetime," is the time during which any 
25 particular localization estimate will be considered valid, and therefore, during 
which the localization estimate will be used for post-processing computations. 
This "lifetime" is simply a measure of the time since a particular localization 
estimate was generated (i.e., the time between the time stamp of the particular 
localization estimate and the current time). 
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In general, as the lifetime increases, more localization estimates will be 
available for post-processing. Typically, given a larger lifetime, the larger number 
of localization estimates available allows for a more reliable distinction to be 
made between real objects and possible reflections, thereby providing a better 

5 localization precision. However, use of a larger initial localization estimate 
lifetime also results in longer localization response times when an object moves 
or otherwise changes position relative to a receiving array. Therefore, optimal 
selection of localization estimate lifetime is made as a function of expected object 
motion. For example, given stationary or slow moving objects, longer lifetimes 

10 are typically appropriate. Conversely, shorter lifetimes are typically more 
appropriate for faster object motions. Consequently, in one embodiment, an 
adaptive lifetime is computed based on computed object motions, with the 
lifetime being chosen as a tradeoff between response time and position validity 
as a function of time. The computed motions are simply computed using position 

15 estimates as a function of time. 

As described in further detail herein, the post-processing then continues 
with a multi-stage process, including: 1) "clustering" the initial localization 
estimates (those within the predetermined life-time) in spatially spread 

20 overlapping sections; 2) identifying "potential objects" within the clusters; 3) 
estimating the position and standard deviation of the potential objects based on 
the clustering; and 4) eliminating likely duplicate objects. In one embodiment, the 
post-processing further continues by computing a confidence level for the 
position estimate for each object (after elimination of duplicate objects). 

25 Consequently, the final result of the post-processing system and method 
described herein is a list of objects, each presented with location, standard 
deviation, and in one embodiment, the confidence level of the object location. 
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2.2 System Architecture: 



The processes summarized above are illustrated by the general system 
diagram of FIG. 2. In particular, the system diagram of FIG. 2 illustrates the 

5 interrelationships between program modules for implementing a system and 
method for post-processing of initial localization estimates for improving the 
precision and reliability of those localization estimates. This system and method 
will generally be referred to below as a "post-processor." It should be noted that 
any boxes and interconnections between boxes that are represented by broken 

10 or dashed lines in FIG. 2 represent alternate embodiments of the post-processor 
described herein, and that any or all of these alternate embodiments, as 
described below, may be used in combination with other alternate embodiments 
that are described throughout this document. 

is In general, the post-processor operates to improve initial localization 

estimates 220 derived from a conventional receiver array 200 and a conventional 
initial localization estimate module 210. As noted above, the receiver array 200 
is any conventional receiving array, such as, for example, a microphone array, a 
directional antenna array, a radar array, a laser receiver array, etc. In addition, 

20 the receiver array 200 may also be a single receiver which acts in a manner 
similar to an array of receivers. For example, a single rotating receiver, such as 
a rotating radar receiver, will acquire signals from various directions about the 
axis of rotation. In this manner, the radar receiver acts in a manner similar to a 
radar array by receiving signals from various directions as it rotates. However, 

25 while localization estimates from such receivers are compatible with the post- 
processor described herein, it should be noted that single receiver systems (as 
opposed to actual arrays) tend to have longer response times for providing 
localization estimates. 

30 The initial localization estimate module 210 is a conventional localization 

or tracking system for providing localization estimates from signals or inputs 
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received from the receiver array 200. For example, in the case of a microphone 
array, the initial localization estimate module 210 is a conventional sound source 
localization (SSL) system. Note that for purposes of explanation, the following 
discussion will continue to use a conventional microphone array as the receiver 
5 array 200, and an SSL system as the initial localization estimate module 210. 
However, in view of the detailed description of the post-processor provided 
herein, it should be understood that the post-processor is operable with any of a 
number of conventional localization systems, and is not limited to merely 
improving the precision and reliability of localization estimates provided by a 
10 conventional microphone array/SSL system. 

Once the initial localization estimates 220 have been provided by the initial 
localization estimate module 210, the post-processor uses a clustering module 
250 for real-time clustering of the initial localization estimates 220 in spatially 

15 spread overlapping sections. Note that this real-time clustering process with 
spatially spread overlapping sections is described in detail below in Section 
3.2.1 . As noted above, in one embodiment, only those estimates whose lifetimes 
have not yet expired are processed via the real-time clustering process. 
Localization estimates whose lifetimes have expired are simply discarded with 

20 respect to the post-processing described herein. 

In one embodiment, a lifetime computation module 230 automatically 
computes optimized localization estimate lifetimes based on computed object 
motions. For example, because the initial localization estimates 220 include a 

25 time stamp indicating the time each localization estimate was generated, it is a 
simple matter to compute approximate object motions as a function of time. 
Given these object motions, the lifetime computation module 230 then 
determines an appropriate lifetime for localization estimates for each object. As 
noted above, these lifetimes are dependent upon the motion of particular objects. 

30 However, the lifetimes are also dependent upon the type of receiver array 200 
and localization system being used. 
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For example, in a tested embodiment, localization estimate lifetimes on 
the order of about 4 seconds was observed to provide good results for 
microphone arrays in combination with SSL for tracking persons speaking in the 
work volume covered by the microphone array. Similarly, in another tested 

5 embodiment, lifetimes on the order of about 2 seconds was observed to provide 
good results for identifying which directional antenna in an antenna array to use 
for receiving particular nodes in a wireless computer network. In each case, the 
lifetime computation module 230 then automatically adjusts the localization 
estimate lifetimes up or down by some small amount to account for computed 

10 object motions, with the goal being to provide a lifetime that ensures that the 
localization estimates are not so old as to be relatively unreliable. Further, in the 
case of multiple objects, each object may then have a different localization 
estimate lifetime, dependant upon the motion of each particular object. 

15 In another embodiment, rather than computing localization estimate 

lifetimes for particular systems, a lifetime input module 240 is used to provide a 
constant lifetime to be used for all initial localization estimates 220. Again, the 
length of the lifetime should be dependent upon the type of receiver array 200, 
the type of localization system being used, and expected object motions. 

20 

Once the clustering module 250 has completed the clustering of 
localization estimates, a potential object identification module 260 then identifies 
clusters of localization estimates representing potential objects within each 
spatially spread overlapping section through a statistical analysis of the 

25 localization estimate clusters. In general, potential objects are identified in these 
overlapping sections by using a threshold for identifying those sections which 
contain one or more objects. As described in detail below in Section 3.2.2, if a 
computed average weight of any section with a non zero number of 
measurements exceeds a predetermined threshold, than it is assumed that there 

30 is a potential object in that section, and it is selected for further processing. 
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A potential object localization module 265 then computes a position for 
each potential object from the localization estimates comprising each cluster. At 
this point, the position of potential objects represents a refinement of the initial 
localization estimates 220. However, as a result of using overlapping sections, 
5 particular localization estimates or measurements may exist in two or more 
neighboring sections. Note that the computation of potential object positions is 
described in further detail below in Section 3.2.3. 

Therefore, the next step uses a duplicate object elimination module 270 to 
eliminate duplications of identified objects resulting from the use of overlapping 
sections. In general, after the object identification module has analyzed the 
clusters of localization estimates, there is a list of hypothetical or potential 
objects, each represented by position, standard deviation and weight. However, 
because of the use of overlapping sections, there are often duplications in this 
list, hence the use of the term "potential objects." In general, as described in 
detail below in Section 3.2.4, duplications are deemed to exist if the distance 
between any two hypothetical objects is less than a threshold distance. Where 
duplicates are deemed to exist, the duplicate object elimination module simply 
removes the potential object having a lower weight from the list. In the unlikely 
case of equal weights, one of the potential objects is simply randomly eliminated. 
Any remaining potential objects are then simply provided as the new localization 
estimates 280. 

In another embodiment, a confidence level computation module 290 then 
25 computes a confidence level or measure for each of the new localization 

estimates 280. Computation of this confidence level is described below in further 
detail in Section 3.2.5. 

Finally, in yet another embodiment, the new localization estimates are 
30 provided back to the initial localization estimate module 210 for use in generating 
new initial localization estimates 220. The use of existing localization estimates 
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for initializing localization computations is well known to those skilled in the art, 
and will not be described in detail herein. 

3.0 Operation Overview: 

5 

The above-described program modules are employed for implementing 
the post-processor described herein. As summarized above, this post-processor 
system and method automatically improves the precision and reliability of 
localization estimates through multi-stage post-processing of input localization 
10 data. The following sections provide a detailed operational discussion of 
exemplary methods for implementing the aforementioned program modules. 

3.1 Collecting Initial Localization Results: 

is As noted above, the post-processor described herein is applicable to 

many different localization techniques. For example, the post processing system 
and method described herein operates to improve sound source localization 
(SSL) results in audio systems using microphone arrays. Similarly, the post 
processing system and method described herein also operates to improve 

20 localization of radio signals, such as, for example, to select particular directional 
antennas in an antenna array for use with computer endpoints in a wireless 
computer network. Other examples include radar and laser tracking systems. All 
such systems will be referred to collectively below as a "source localizer." 

25 Regardless of what type of localization system is being used, it is 

assumed that, in the case of three dimensional localization data, that each 
localization measurement includes: 1) an object location, represented by 
direction, elevation and distance; 2) a weight which provides an indication of how 
much the source localizer trusts this measurement; and 3) a time stamp which 

30 indicates the time of each localization measurement. 
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As noted above, the actual algorithm used by the localizer for generating 
the initial localization estimates doesn't really mater, as long as it provides the 
information noted above (i.e., object location, weight, and time). After each 
successful measurement the initial localization estimates are collected by a post- 

5 processor input queue. The post-processor then removes all measurements 
from the input queue that are older than a given lifetime T. As noted above, this 
lifetime is either automatically computed based on object motions, or predefined 
for particular localization systems. In general, using a larger lifetime means that 
more results are available for processing, thereby facilitating a more reliable 

10 distinction between real objects and reflections, and providing for better 

precision. However, as noted above, longer lifetimes result in larger response 
times when the object or signal source moves or changes position. 

3.2 Post-Processing of Initial Localization Estimates: 

15 

Post-processing of the initial localization estimates involves a number of 
stages, including: clustering, finding potential objects (i.e., sound sources, radio 
sources, etc.), estimating positions of the potential objects, reduction or 
elimination of duplicate objects, and calculation of the confidence levels. These 
20 post-processing stages are described in the following Sections. 

3.2.1 Clustering Initial Localization Estimates: 

It is assumed that the work volume (i.e., the space or volume being 
25 monitored or served by the receiving array) is pre-defined or can be readily 
determined using conventional techniques. In particular, it is assumed that the 
following parameters are known with respect to the work volume: 

• minimum and maximum direction angles ^ and tp m \ 

30 • minimum and maximum elevation angle 6^ n and 9 m ; and 

• minimum and maximum distance p. and p mf . 

r nun » max 
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Given these parameters, the work volume is automatically divided into a 
number, M, of overlapping regions or sections. Note that using regions of equal 
size is not necessary, but it serves to reduce computational complexity. Note 
that the section size depends on the precision of the initial estimator and 
5 shouldn't typically be larger than about four to six times the standard deviation, 
e.g., 6a. 



Alternately, in one embodiment, the number of overlapping regions or 
sections and amount of overlap is user definable. Any number or size of regions 

10 may be used. However, using more regions with smaller sizes tends to increase 
the precision of localization estimates, at the cost of increased computational 
overhead. In particular, the region size is the resolution of the post-processor to 
detect separate objects, assuming one object per cluster. The minimal size, 
however, depends on the precision of the initial estimator, where it is desired to 

15 have almost all measurements from one object to go to the same cluster. 
Therefore, for standard distributions this means that that a region size of 
approximately 6 times the standard deviation (e.g., 6a ) will capture about 99% of 
the measurements for the object. However, for particular purposes, the cluster or 
region size can be larger than 6a . 

20 

In a tested embodiment involving a microphone array and sound source 
localization, the work volume was divided into M overlapping regions, each with a 
size six times the standard deviation in each dimension (i.e., direction, elevation, 
and distance). Choosing areas of this size, and allowing a 50% overlap between 
25 regions, gives the number of overlapping regions, in accordance with Equation 
(1), where: 



ftf = g^nax ^min ^max ^min Pmax Pmin Equation (1 ) 

6cr 6a R 6cr ; 



30 
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Note that in this tested embodiment, the 6's in the denominator of 
Equation (1) represent the use of region sizes of six times the standard deviation 
of the initial estimator. As noted above, other region sizes may be used to 
provide for the desired accuracy. However, it should be noted that a 6-sigma 
interval (e.g., ±3-sigma around the average) holds approximately 99% of the 
measurements. In addition, the number "8" in Equation (1) is a consequence of 
the amount of overlap between regions. In particular, because a 50% overlap 
was used in this example, the factor of "8" is simply results from "2x2x2 = 8" 
(e.g., 50% overlap in each of the three dimensions). In view of the preceding 
discussion, it should be clear that more overlapping means more clusters, and 
thus more duplicated objects, and consequently more computational overhead. 
However, less overlapping means that objects in the middle between the clusters 
may be missed. For a 6-sigma section width, it has been observed that the 
optimal overlapping is about 66%. This means that the overlapping region is 4 
sigmas, which will hold approximately 80% of the measurements in the worst 
case. 50% overlapping was used in the tested embodiment simply to reduce 
computational overhead. 

Once the number of regions is known, the work volume is then simply 
divided into that number of regions. Each initial localization estimate is then 
simply assigned to whatever regions of the work volume cover each initial 
localization estimate. Because each initialization estimate has an associated 
lifetime, multiple localization estimates are typically available for each object. 
Consequently, assigning localization estimates to particular regions results in the 
formation of localization estimate clusters within each region. Note that due to 
the use of overlapping regions, particular measurement may be assigned to two 
or more neighboring sections, depending upon the amount of overlap. As noted 
above, the use of larger regions of overlap results in better resolution for 
separate objects but increases the necessary computations. 
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3.2.2 Identifying Sections with Potential Objects: 

Once the work volume has been divided into regions, and clusters of 
localization estimates formed as described above in Section 3.2.1, a 
determination is made as to which regions have potential objects within them. 
For example, if a particular region has no localization estimates, it is assumed 
that there are no objects within that particular region, so that region is then 
excluded from further processing so as to reduce computational overhead. 

However, in another embodiment, determination of whether particular 
regions include one or more potential objects is accomplished using a weight- 
based threshold. In particular, in a tested embodiment, identifying those sections 
with potential objects was accomplished by first computing an average weight, 
W lh , of all sections with a non-zero number of measurements (i.e., at least one 
localization estimate was assigned to that particular region), which represents the 
weight-based threshold. W lh then represents the threshold weight used to gauge 
each region of the work volume. Note that the threshold weight is computed 
using the original weights provided as part of the initial localization estimates. 
For example, one method for computing such weights is provided in Equation (2), 
as follows: 



where W 9 is the original weight ofy-th measurement in i-th section, N t is the 

number of measurements in i-th section, Mis the number of sections, L is the 
number of sections with a non-zero number of measurements, and K is a 
constant. 

Note that the value of K depends on the number of the sections and the 
number of false measurements (i.e., measurements in sections resulting from 




Equation (2) 
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reflections and noise rather than real objects). In a tested embodiment, a value 
of K=2 was used so to ignore as false measurements in as many sections as 
possible. In particular, it has been observed that sections with false 
measurements typically contain about one to two measurements, as a result of 
reflections and noise. Using a value of K=2 allows the post-processor to ignore 
sections with very small numbers of measurements so that such measurements 
are not counted as potential objects. Note that the value of a A" is application 
dependent, and for particular implementations of the post-processor, it may be 
desirable to avoid processing of sections with less than three measurements (or 
some other minimal number), depending on how many measurements are 
provided by the initial estimator as a function of time. 

A total section weight for each section which represents the sum of the 
weights of all measurements in each particular section is then computed for each 
section, as illustrated by Equation (3): 



If any section weight, W, , is greater than the threshold weight, W lh it is assumed 

that there is a potential object in that section, and it is flagged for further 
processing as described below. 

3.2.3 Position Estimation Computations: 

While position estimates can be computed for potential objects in all of the 
regions of the work volume, it is unlikely that there are actually any objects, or 
any localization estimates having high reliability, in regions that were not flagged 
in the previous step as having a weight exceeding the threshold, W lh . 
Consequently, in one embodiment, position estimates are only computed for 
potential objects within those regions having a weight which exceeds the 




Equation (3) 
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threshold, W A . In general, the position estimation provided during this stage of 
the post-processing involves a two-stage statistical processing approach. 

In particular, for each section, the first stage computes a weighted 
averaging of all measurements in the section. In one embodiment, the speed of 
this statistical processing is increased by first converting each position, /?.., to a 
rectangular coordinate system, as illustrated by Equation (4): 

x & = p s cos<p ij cos6 ij 

y g = py sin <p g cos 0 & Equation (4) 

z & = Pg cos0 9 

The weighted average and weighted standard deviation are then calculated as 
illustrated by Equations (5) and (6), as follows: 

1 ^ 

P' = wZ2 w vP(t Equation (5) 

VY i y=o 



a. = 



i 



w. 

Equation (6) 



where/?, is the position of the object in this section, and a. t is the standard 
deviation of the object position estimate. 

Due to the use of overlapping sections, and dependant upon the size of 
each sections, some of the localization measurements belonging to the same 
object might exist in two or more neighboring sections. Further, there may also 
be noise measurements existing in each such section. Therefore, in one 
embodiment, to eliminate these measurements, and to improve overall positional 



-28- 



precision, a second pass of weighted averaging is performed for measurements 
within a particular distance of the computed object position, p h 

In a tested embodiment, a weighted averaging of measurements in the 
5 range of p i ±2a i was used to obtain a final position estimate pjor each section. 

The number of measurements in the range of p i ±2a i is denoted as n r The 

weighted standard deviation a. for p. is then recalculated for precision 

estimation as illustrated by Equation (7), as follows: 



Finally the sum of the weights W. of all position measurements in the range of 
Pi±2a. is computed for each potential object, p r 

15 3.2.4 Reduction of Potential Objects: 

At this point a list of hypothetical or potential objects has been generated, 
with each of these potential objects being represented by position, p n standard 

deviation, 5 i , and weight, W r The goal of this step is to eliminate duplications 

20 from the list. As noted above, duplications may exist due to the use of 

overlapping sections. In one embodiment, the criterion for determining whether 
any two objects are duplicates is based on a computed distance between any 
two or more hypothetical or potential objects. For example, if the distance 
between any two hypothetical objects k and /, where (k±i ), in all directions is 

25 less than the amount of section overlap in each direction, then these two objects 
are considered as actually representing the same object or signal source. 
Assuming a six sigma section width and 50% section overlap, the distances are 
simply computed as illustrated by Equation (8), as follows: 



10 




(7) 



-29- 



I** — */| < 3o- p COS CT^ COS<7 fl 

\y k - y, I < 3o n sin a v cos a, (8) 
^-zJOcr, sin <r fi 

Once it is determined that any two potential or hypothetical objects are actually 
5 the same object, the object having a lower weight, W. t , is simply removed, 
thereby reducing the total list of objects. At this point the list of objects 
represents a set of object localization measurements or position estimates 
having improved precision and reliability with respect to the initial localization 
estimates. 

10 

3.2.5 Calculation of Position Estimate Confidence Levels: 

In one embodiment, the post-processing continues by computing 
confidence levels for each object position estimate generated during the post- 
15 processing stages described above. Typically, confidence levels are provided as 
a number in a range of 0 to 1 , with 0 representing no confidence, and 1 
representing full confidence. The factors affecting the computed confidence 
levels are the number of measurements , the standard deviation <?,. and the 
latest time stamp among these n. measurements. 

20 

In particular, when the number of measurements is less than a given 
number, N cri „ the confidence level decreases, thereby indicating less trustworthy 
position measurements. Note that the value of N cri , depends on the desired 
reaction time, and upon how many initial measurements are provided by the 
25 initial estimator. For example, in a tested embodiment of an SSL system, the 
desired reaction time was 0.5 seconds, and the initial estimator provided 
approximately 10 measurements per second. Consequently, at least five 
measurements, i.e., N crit = 5, were needed to have a confidence of 1. Increasing 
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Ncnt makes the confidence level slowly increase from the time someone begins to 
speak from a particular location. However, averaging and statistically processing 
less than about 5 to 10 measurements is not typically very reliable with in the 
type of SSL system used in this tested embodiment. 

5 

Similarly, position measurements having larger standard deviations are 
also typically less trustworthy. Finally, when particular position measurements 
are based on older data, those position estimates are also typically less 
trustworthy. For example in a tested embodiment, the confidence level was 
10 decreased for objects as the age of the most recent measurement approached 
the measurement lifetime (see Sections 2.1 and 2.2 for a discussion of 
measurement lifetimes). 



Therefore, in one embodiment, confidence levels were computed for each 
15 measurement by considering each of the above-mentioned factors. One such 
method for considering these factors to generate a position measurement 
confidence level is provided by Equation (9) and (10), as follows: 



20 



c ie = a J^e Equation (9) 

c iT ={t-T iLasl )l(2T) 



where c iN is the confidence level based on the number of measurements; c ilf) , c w 

and c ip are the confidence levels based on the standard deviation; and c„ is the 

confidence level based on the last measurement time stamp T iLast where t is the 

current time and T L is the measurements lifetime. After limiting, or clipping, the 

25 values of the confidence level sub-levels shown in Equation (9) in the range 
between 0 and 1 , the final confidence level is then calculated as illustrated by 
Equation (10), as follows: 
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C i — C iN C iip C i9 C ip C iT 



Equation (10) 



where the final confidence level, c„ will be between 0 and 1 by virtue of limiting 
5 each of the sub-confidence levels to this range. 

The computed confidence levels for each measurement are then included 
along with the final result from the post-processor (described in Sections 3.2.1 
through 3.2.4) to produce a list of objects, each presented with location, 
10 confidence level and standard deviation. As noted above, this list represents a 
refinement of the initial localization estimates and provides for more precise and 
reliable localization measurements or estimates relative to the initial localization 
estimates. 

15 4.0 Post-Processor Operational Overview: 

The processes described above with respect to FIG. 2, and in view of the 
detailed description provided in Sections 2 and 3 are illustrated by the general 
operational flow diagram of FIG. 3. In particular, FIG. 3 illustrates an exemplary 

20 operational flow diagram which illustrates operation of the post-processor. It 
should be noted that any boxes and interconnections between boxes that are 
represented by broken or dashed lines in FIG. 3 represent alternate 
embodiments of the post-processor described herein, and that any or all of these 
alternate embodiments, as described below, may be used in combination with 

25 other alternate embodiments that are described throughout this document. 

In general, as illustrated by FIG. 3, post-processor operation begins by 
accepting an input of localization estimates 220 generated 300 using a 
conventional localization technique for producing positional measurements from 
30 a receiving array 200. As described above, these conventional localization 
techniques are well known to those skilled in the art, and include techniques, 
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such as, for example, conventional SSL techniques for using a microphone array 
to localize sound sources within a work volume. 

The work volume is then divided into a number of overlapping regions or 
5 segments 310. As described above in Section 3.2.1 , in alternate embodiments 
the number of overlapping regions and amount of overlap used is either 
predefined, user definable, or automatically computed. 

Next, each initial localization estimate 220 is then simply assigned 320 to 
10 corresponding regions of the work volume based on the position of each of those 
initial localization estimates. However, as described above, multiple localization 
estimates are typically available for each object. As a result, particular 
localization estimates may actually be assigned to two or more regions due to the 
use of region overlap. Assigning 320 these initial localization estimates to the 
is various regions serves to form localization estimate clusters within one or more 
of the regions. 

A statistical analysis of the clusters in each region is then used to 
determine which of the regions contain potential objects 330. The point of this 
20 step is simply to avoid performing unnecessary computations for those regions 
that do not contain potential objects. Thus, while this step is not essential, it 
does serve to provide a reduction in computational overhead for the post- 
processor. 

25 Once the regions containing potential objects have been identified 330, 

the positions of each potential object are then estimated 340, again using a 
statistical analysis of the clusters representing each potential object within each 
region. These estimates positions are then used to determine whether any of the 
potential objects are duplicates. For example, because the regions overlap, and 

30 because particular measurements may therefore simultaneously exist in two or 
more regions, it is possible that particular objects can be identified as a potential 



-33- 



object in more than one region. This issue is addressed by simply comparing the 
actual positions and weights computed for each potential object to eliminate 350 
lower weight objects where a computed distance between potential objects 
indicates that two objects are actually the same object. 

5 

The remaining potential objects are then used to populate the list or set of 
new localization estimates 280, as described above. Further, in one 
embodiment, confidence levels are computed 360 for the position of each of the 
new localization estimates 280. In general, these confidence levels provide a 
10 measure of the reliability of the new localization estimates 280 based on a 

number of factors, including the number of measurements used in computing 

the new localization estimates 280, the computed standard deviation, & g , of each 

measurement, and the relative age of the data used to compute the new 
localization estimates 280. 

15 

Finally, in one embodiment, the new localization estimates 280 are 
provided as an input to the initial localization system which is used to generate 
300 the initial localization estimates. For example, as is well known to those 
skilled in the art, the use of current or prior position information is often used as 
20 an initialization factor when computing localization estimates based on new 
observations provided from a receiver array 200, such as a microphone array or 
other device. 

The foregoing description of the post-processor for automatically 
25 improving the precision and reliability of initial localization estimates has been 
presented for the purposes of illustration and description. It is not intended to be 
exhaustive or to limit the invention to the precise form disclosed. Many 
modifications and variations are possible in light of the above teaching. Further, 
it should be noted that any or all of the aforementioned alternate embodiments 
30 may be used in any combination desired to form additional hybrid embodiments 
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of the post-processor. It is intended that the scope of the invention be limited not 
by this detailed description, but rather by the claims appended hereto. 
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